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I. INTRODUCTION 

It is commonly accepted that Quantum Mechanics de- 
scribes the entire "physical world" . In particular, equi- 
librium Statistical Mechanics, and possibly also its limi- 
tations, should follow from Quantum Mechanics, though 
a really satisfying "derivation" still does not seem avail- 
able. The main objective of the present paper is such a 
derivation of the "summit" flj of equilibrium Statistical 
Mechanics, namely the canonical ensemble, from Quan- 
tum Mechanics in combination with certain, very weak 
assumptions regarding the preparation, the observables, 
and the Hamiltonian of the system. 

A key issue in Statistical Physics is the lack of knowl- 
edge about many "details" of any given "real" system. 
In particular, the initial condition of the typically 10 23 
particles is unknown apart from a few "gross" features, 
for instance the (approximate) total energy and a few 
additional macroscopic properties in case of an non- 
equilibrium initial condition. The standard way to deal 
with this incomplete knowledge is to consider a statis- 
tical ensemble (many repetitions of the "same" experi- 
ment, formally described by a density operator) instead 
of one particular "realization" of the experiment (for- 
mally described by either a pure or a mixed quantum 
mechanical state) . Much less appreciated is the fact that 
the concrete statistical ensemble for any given "real" sys- 
tem is also largely unknown and would be at least as 
hard to actually determine as the state of any single re- 
alization. It is therefore unavoidable to introduce certain 
postulates regarding this largely unknown statistical en- 
semble. Their only justification is their plausibility, an 
admittedly subjective concept. For instance, the present 
author finds it not plausible to assume for an isolated 
macroscopic system at equilibrium a microcanonical en- 
semble, be it as a postulate per se or as a consequence 
of some other hypothesis or principle (e.g. Jaynes prin- 
ciple): Its key claim is that if one would be able to actu- 
ally determine the ensemble averaged occupation prob- 
abilities of all energy levels, one would always find that 
all levels within some small energy-interval are occupied 
with exactly equal probabilities and all other levels are 
not occupied at all. Since level populations are constants 
of motion, the same properties must apply to the pos- 
sibly far from equilibrium initial ensemble. This seems 
indeed very unlikely to be true for any given "real" sys- 



tem, independently of any further "details" of the setup 
and the preparation. For instructive numerical examples 
see e.g. Ref. Q- 

Put differently, we find it plausible that there exists 
a well-defined statistical ensemble describing the initial 
state of any given "real" system. But its details are un- 
known, probably very complicated, and already quite dif- 
ferent even when the "same" experiment is repeated in 
two different labs. Hence the chance that the unknown 
"true" ensemble happens to agree with any particular en- 
semble we are postulating is virtually zero. Our present 
solution of this problem is not to assume any specific form 
of the "true" initial ensemble, but only some very gen- 
eral "gross" features. Namely, we will assume that the 
ensemble averaged occupation probabilities of the energy 
levels can be locally (on the energy axis) averaged in a 
well-defined manner. We will provide good reasons to 
expect that real systems satisfy the assumption by closer 
inspection of the preparation procedure at the origin of 
the initial condition. 

A second important deficit of knowledge regards the 
appropriate Hamiltonian (and Hilbert space) of the sys- 
tem, generating the time evolution of the initial ensemble. 
To begin with, we will take it for granted that the system 
can be treated as strictly closed (isolated, autonomous), 
though in real systems small remnant interactions with 
the rest of the world are unavoidable. The main rea- 
son is that standard Quantum Mechanics is only able to 
describe the evolution in time of closed systems. Open 
systems, interacting and entangled with an environment, 
can be handled only "indirectly" , by first including the 
entire relevant environment of the open system into a 
closed supersystem, then evolving the latter by standard 
Quantum Mechanics and finally eliminating the environ- 
ment again. Likewise, we assume that after including all 
relevant perturbations "from outside" into the consid- 
ered system, it must be possible to theoretically model 
it as strictly isolated from the rest of the world. If this 
were not possible, the problem could only be treated by 
means of a generalization of Quantum Mechanics. Many 
(often hidden) hypotheses of how small external pertur- 
bations may modify standard Quantum Mechanics have 
been proposed. We do not share the viewpoint that such 
extensions are a pivotal point in the foundation of Statis- 
tical Mechanics. Otherwise, an unavoidable consequence 
would be that Statistical Mechanics does not "work" for 



strictly closed systems. Also numerical evidence seems 
to support our present viewpoint. 

Taking for granted a closed system, the "right" Hamil- 
tonian (and Hilbert space) is still far from obvious, and 
even the existence of one particular "true" Hamiltonian 
for any given "real" system is questionable. Our present 
way to deal with this problem is to focus on "generic" 
Hamiltonians, while leaving any further details unspeci- 
fied. 

The Quantum Mechanical time evolution generated by 
the Hamiltonian will not be touched in any way, neither 
by heuristically modify it to account for small remnant 
external perturbations (see above), nor by introducing 
any kind of approximation. In other words, the well- 
known time-inversion invariance of Quantum Mechanics 
is fully and rigorously maintained. 

Within the above general framework, the main sub- 
ject of our present work is the long-time evolution of a 
largely arbitrary and possibly far from equilibrium initial 
ensemble. Specifically, we address the two key claims 
of Statistical Mechanics in this context: (i) Equilibra- 
tion: The ensemble approaches a stationary long-time 
behavior, (ii) Thermalization: Provided the ensemble 
exhibits a sharply peaked energy distribution, the long- 
time steady state is captured by (i.e. is experimentally 
indistinguishable from) the microcanonical ensemble cor- 
responding to the given energy peak. In particular, if 
the total closed system consist of a subsystem of actual 
interest which is weakly coupled to a much "bigger" en- 
vironment (canonical setup) , then the steady state of the 
subsystem alone (after eliminating/tracing out the envi- 
ronment) is captured by the canonical ensemble. 

A satisfactory derivation of these cornerstones of Sta- 
tistical Mechanics from Quantum Mechanics is a long- 
standing and still unsolved fundamental problem |3l4l3j. 
and has recently attracted considerable renewed interest 
in the context of (almost) integrable many-body quan- 
tum systems @, [l4| - tl6| . 

In this work we address the questions in how far and 
in which sense equilibration and thermalization can be 
derived within our above specified general framework 
with particular emphasis on the canonical setup. The 
main new results of our present paper are established 
in Sects. IIXI and IXII demonstrating that in the long- 
time limit, the reduced state of the "small" system is 
for all practical purposes indistinguishable from a canon- 
ical ensemble. More precisely, the true ensemble itself 
remains time-dependent forever and thus quite different 
from the canonical density operator, but the experimen- 
tally observable differences between the two ensembles 
are unresolvably small for the overwhelming majority of 
times. The other sections of the present paper provide 
the pre-requisites needed in those central Sects. IIXI and 
IXTl They mostly collect and unify, but partially also ex- 
tend previously known material. The final section IXIII 
contains the summary and outlook with particular em- 
phasis on closely related recent works and on the subject 
of (almost) integrable many-body quantum systems, be- 



ing at the focus of the present Special Issue. 



II. GENERAL FRAMEWORK 

According to Sect. 1, we consider an isolated system, 
incorporating all relevant parts of the environment (ther- 
mal baths, reservoirs etc.), and modeled according to 
standard Quantum Mechanics by an autonomous Hamil- 
tonian H on a. (separable) Hilbert space H. Specifi- 
cally, we focus on spatially finite (compact) systems with 
a large (macroscopic) but finite particle number, corre- 
sponding of / degrees of freedom with 



K / < oo 



(1) 



As a consequence, all eigenvectors of H represent bound 
states and hence the spectrum of H is discrete (quan- 
tized). As usual, \n) (n = 0, 1, ...) denote the (typically 
infinitely many) eigenvectors of H and the corresponding 
eigenvalues E n are assumed to be ordered, 



E <E 1 <E 2 < 



(2) 



with a finite ground state energy Eq > — oo [T3| - I n other 
words, the Hamiltonian can be written as 



H:=Y,E n \n)(n\ 



(3) 



where ^~J n indicates a summation over all n = 0, 1, .... 

For any function g : R — » R we adopt the common 
definition 



g(H):=Y,9(En)\n)(n\ 



(4) 



In the special case of a power series, g(x) = J^k 9kX k , one 
readily sees that the dU reproduces J2k 9kH k , as it must 
be. But (U) also covers more general functions g(x). 

Degenerate energies correspond to equality signs in ([2} . 
Yet, sums like in ^ and (@| are meant to run over all 
n- values. 

For any energy eigenvalue E n , the projector onto the 
associated eigenspace of H is given by 



Pp 



E m = E n 



(5) 



where J^e =e indicates a summation over all m-values 
satisfying E m = E n . It follows that Pg m = Pe„ for de- 
generate energies E m = E n . Consequently, the identity 
operator 



1 W :=£>)<T 



can be rewritten as 



E„ 



(6) 



(7) 



where ^2 E indicates a summation over all mutually dif- 
ferent -E„-values, i.e. degenerate energies only appear 
once in the sum. Likewise, the Hamiltonian from ([3]) can 
be rewritten as 



H = Y,E n P En 



(8) 



A. Level Counting, Entropy, Temperature 

In the following, we collect some "well-know" results of 
"elementary level counting" . Though some details may 
be strictly speaking more subtle than we will say below, 
and possibly even not yet proven rigorously in sufficient 
generality, our present point of view is that these un- 
solved issues of Statistical Physics are substantially less 
critical than those at the actual focus of our present pa- 
per (equilibration and thermalization, see Sect. IJ). Dif- 
ferently speaking, we will henceforth restrict ourselves to 
Hamiltonians H which satisfy the properties given be- 
low and adopt the common opinion that they cover most 
cases of practical relevance. 

We emphasize that the entire present section exclu- 
sively deals with properties of the energy eigenvalues of 
the Hamiltonian Q. In other words, we do not speak 
about system states at all, and hence the considerations 
of the present section do not depend on whether the sys- 
tem is in or out of equilibrium, since these are specifica- 
tions of the system state, not of the system per se. 

To begin with, the number of energy levels below any 
given upper limit E is defined as 



Q(E) :=^2@{E-E n ) 



(9) 



where Q(x) := J_ dyS(y) is the Heaviside step- 
function. The entropy then follows as 



S(E) :=k B \nn(E) 



(10) 



where k B is Boltzmann's constant. (The more common 
definition S(E) := k B [\nil{E) - \nil{E - AE)} with a 
small but finite AE is well-known to be equivalent to (fTU)) 
but would be less convenient later on.) Focusing on the 
most common case, the entropy is an extensive quantity. 
For a system with / degrees of freedom, S(E)/k B is thus 
very roughly speaking comparable in order of magnitude 
to/, 



S(E)/k B = O(f) 



(11) 



It follows from © and (fTTj) that in macroscopic sys- 
tems with / = O(10 23 ) degrees of freedom the energy 
levels are unimaginably dense on any decent energy scale: 
for instance, within an energy interval of 1C)( -10 ' Joule 
there will still be of the order of 10^ 10 > levels. On these 
exceedingly tiny scales, the step function Q(x) appearing 
in © is considered to be actually "washed out" , so that 



il(E) becomes a reasonably smooth function of E with a 
well defined derivative 



;(E):=n'(E) = Yj(E-E n ) 



(12) 



where also the delta-function S(x) — ®'(x) is considered 
as "washed out" over many energy levels. In other words, 
ui(E) represents the density of states. The correspond- 
ingly washed out entropy (fTOj) gives rise to the usual def- 
inition of temperature, 



Combining 
lation 



T{E) := 1/S'(E) . (13) 

(USD, and JT5J, ([13} yields the useful re- 



k B T(E) = Q(E)/oj(E) 



(14) 



We reiterate that in our present work, entropy and tem- 
perature are by definition given by HU\) and i!3\) , and as 
such are for the moment completely independent of the 
question of whether the considered system is at equilib- 
rium or not. In particular, we did not establish any re- 
lation so far between the energy E and the state of the 
system. 

The following statements can be readily verified for 
simple examples like the ideal gas. More detailed esti- 
mates, which we omit here, indicate that they in fact 
remain true quite generally: The actual dependence of 
the right hand side of Eq. (fTTj) on E is such that the 
entropy approaches zero for E 1 Eq, but for the rest (i.e. 
for macroscopic values of E — Eq), the dependence on 
E is comparably weak, essentially of logarithmic form. 
Likewise, (|13j) approaches zero for E \. Eq. Combining 
all these properties of S(E) with (TTTJ) we can conclude 
that 



k B T(E) = O 

and hence 

T(E + AE) =T(E) 



E — Eq 

~1~ 



1 + 



AE 

E — Er 



(15) 



(16) 



Note that if we did not avoid speaking about equilibrium 
states, then (|T5|) could also be justified via the equipar- 
tition of energy for an extensive systems with energy E 
at equilibrium. 

Finally, we can infer from (|15p and the derivative of 
this relation in combination with (1131) that 



S"(E) k B T'(E) 



k B [k B T{E)\ 



= o 



I 



[E - E f 



(17) 



Similarly as in the first paragraph of this section, we 
may take the alternative point of view that we only con- 
sider model Hamiltonians H which capture the follow- 
ing common property of real systems reasonably well: 
If the macroscopic system energy E is changed by an 



amount dE then the microscopic kinetic energy per de- 
gree of freedom kgT(E)/2 changes by an amount which 
is very roughly of the order of magnitude of dE/ f, i.e. 



k B T'{E) = 0(1/ f) 



(18) 



Since T(Eq) = 0, integration of (fl8j) implies (TT5J) and 
(IT71) . Similarly, the third law S(E ) = and (H3J) yield 
upon another integration the relation (fTTj) and the log- 
arithmic dependence of S(E) upon E — Eq mentioned 
above (fT5l) . 

It is well-known that all these relations may become 
problematic for extremely low temperatures. Such cases 
are tacitly excluded from now on. 



B. System States and Dynamics 

According to standard Quantum Mechanics, the state 
of the system is at any time instant t given by a den- 
sity operator p(t). While we are mainly interested in 
statistical ensembles (mixed states) in this paper, it is 
nevertheless worth to point out that formally our consid- 
erations will also cover pure states as special case. The 
time evolution can be written as 

P (t) = U t p(0)U} (19) 

with unitary propagator 

U t := exp{-iHt/h} = ^2exp{-iE n t/H}\n){n\ (20) 

n 

where we have exploited ((H) in the last relation. Denoting 
the matrix elements of p(t) by 



Pmn(t) •— (m\p(t)\n) 



(21) 



Eqs. (JT9J) and (|20|) yield for an arbitrary initial condition 
p(0) at time to = the result 

Pit) =Y,P^(0)^ l[Em ~ E " ]t/h \m)(n\ , (22) 



where ^2 mn indicates a summation over all m, n 
0,1,2,.... 



C. Level populations 

Level populations are denoted by p n and refer to the 
ensemble averaged occupation probabilities of the energy 
eigenstates \n). In other words, p n is the expectation 
value of the observable |n)(n|. According to (|2"T|) and 
(f2"2"|) it can be rewritten as 

p n := Tr{\ri)(n\ p(t)} = p nn (t) = p nn (0) (23) 

independently of t. 

Since every density operator p(t) is, for any value of 
t, non-negative and Hermitian, it follows that (ip, <j)) :— 



(i/j\p(t) + e\4>) satisfies the scalar product axioms for any 
e > 0. Hence, Cauchy-Schwarz's inequality applies, i.e. 
KV^)! 2 < (</>, 4>) (V'iVO- I n the limit e — > we thus 
obtain 



\Prnn{t)\ 2 < Pmm(t) pnn(t) = PnPn 



(24) 



Similarly as in (|23|) , the ensemble averaged occupation 
probability pe„ of an energy eigenvalue E n is given by 
the expectation value of the projector $5§ onto the cor- 
responding eigenspace and can be rewritten as 

PE„ ■= Tr{-F*E„ p(t)} = ^ Pmm{t) = ^ Pm ' 

E m =E n E m =E n 

independently of t. For convenience, also the pE n will 
sometimes be called level populations. 
The obvious normalization conditions are 



1 = Trp(t) = ^2pnn(t) ^^Pn^^PEr, 



(26) 



Since PE n p(0)PE n commutes with H from ([8]), we can 
without loss of generality, choose a basis in which both 
operators are simultaneously diagonal. In the following, 
we always work with this specific energy basis due to its 
convenient property that all the non-diagonal elements 
of PE n p{Q)PE n vanish, 



Pmn(0) = if m ^ n and E m = E n 



(27) 



Given any ensemble p(t), let V.+ C H be the sub- 
Hilbert space spanned by those basis vectors \n) for which 
PE n ^0, 



n, 



span{|n) \p En > 0} . 



(28) 



Exploiting (|2"4"|) it follows that p nm {t) — whenever 
Pnnit) — or p m m(t) = 0. Denoting by P+ the projector 
onto %+ we thus can conclude that 



p(t):=P+p(t)P + 



D. Observables 



(29) 



As usual, observables are represented by Hermitian op- 
erators 

A = ^A m „|m)H (30) 

mn 

A mn := (m\A\n) (31) 

with expectation value 

(A)(t):=Tr{p(t)A] (32) 

and, without loss of generality, are assumed not to de- 
pend explicitly on time. 

According to (j2"9")l it follows that 



(A)(t):=Tr{p(t)A + } 



(33) 



where A + is the projection/restriction of A onto H.+ from 



A, 



P+AP+ 



(34) 



In other words, only the sub-Hilbert space % + and 
the projected/restricted observables (jM)) actually mat- 
ter. Hence we can from now on replace W and A by TA.+ 
and A + whenever it will be convenient. 



III. THE PROBLEM OF EQUILIBRATION 

Generically, the statistical ensemble p{t) is not station- 
ary right from the beginning, in particular for an initial 
condition p(0) out of equilibrium. But if the right hand 
side of (|22l) depends on t initially, it cannot approach 
for large t any time-independent "equilibrium ensemble" 
whatsoever. In fact, any mixed state p(t) returns arbi- 
trarily "near" to its initial state p(0) for certain, suffi- 
ciently large time-points t, and similarly for the expec- 
tation values (|32[) . as demonstrated for instance in Ap- 
pendix D of Ref. [l8(. We emphasise, that these arbi- 
trarily close recurrences do not refer to pure states only 
(as in the classical Poincare recurrences) but rather to 
arbitrary statistical ensembles p(t). 

More specifically, consider any p(t) which is not com- 
pletely independent of t. Then, according to ([22]) there 
must exists at least one p mn (0) ^ with cj := [E n — 
£ m ]/S ^ 0. In fact, one expects that one usually finds 
pairs with w-values ranging from extremely small to ex- 
tremely large values on the scale of 1 Hz, thus including 
any experimentally "reasonable" frequency. Focusing on 
the specific observable 

A = A + A*, A:=\m){n\/p mn {0) (35) 

it readily follows from ([22)) that 

Tr{p(t)A} = 2 cos(wf) . (36) 

In other words, the ensemble p(t) exhibits permanent 
oscillations rather than equilibration, at least as far as 
the observable A is concerned. 

The main implication of the two previous paragraphs 
is that equilibration, as specified in Sect. HI cannot be 
true and hence cannot be proven in full generality and 
rigor. Put differently, Quantum Mechanics and equilibra- 
tion are strictly speaking incompatible. Equilibration can 
at most approximately hold true for a restricted class of 
observables A and initial conditions p(0). The main ob- 
jective of our present work is to show that, and in which 
sense this is indeed the case under rather weak restric- 
tions regarding observables and initial conditions. Those 
are the subject of the next two Sections. 



IV. REALISTIC OBSERVABLES 

The basic idea is that it is not necessary to theoreti- 
cally admit any arbitrary Hermitian operator A as a pos- 



sible observable [19H22J. Rather it is sufficient to focus 
on experimentally realistic observables in the following 
sense [23]: Any observable A must represent an experi- 
mental device with a finite range of possible outcomes of 
a measurement: 



A A := m.a,^(ip\A\ip) 



- xaijx{ij)\A\'il)) = a ri 

r~L 



(37) 



where the maximization and minimization is over all nor- 
malized vectors \ip) € %. Accordingly, a max and a m i n are 
the largest and smallest eigenvalues of A. 

Moreover we require that this working range Aa of the 
device A is limited to experimentally reasonable values 
compared to its resolution limit SA. All measurements 
known to the present author yield less than 20 relevant 
digits, i.e. A A /SA < 10 20 . Maybe some day 100 or 1000 
relevant digits will become feasible, but it seems reason- 
able that a theory which does not go very much beyond 
that will do. We also remark that range and resolution 
are specific to the given measurement device, but are 
(practically) independent of the properties (e.g. the size) 
of the observed system. 

The above specified class of admissible observables 
clearly includes any realistic measurement apparatus. Yet 
it turns out that the class of observables which will be 
admissible in our main results below can still be substan- 
tially extended in two steps. 

First, as said at the end of Sect. Ill Dt we can replace 
the full Hilbcrt space H by the sub-Hilbert space T-L+ 
defined in (|2"5)) . Accordingly, the full range A^ from (f3~7| 
can be replaced by the reduced range 



A' A ■- max(^|A|V>) - mm(t/;\A\^} 

rL-\- rL-\- 



(38) 



According to (f2"8")l . 1~L + is at most as large as H. How- 
ever, in many cases the level populations (25) may be 
safely negligible e.g. beyond some finite upper energy 
threshold, yielding a finite-dimensional W+, while H is 
typically infinite dimensional. Hence, the reduced range 
A^ from (|38p will be finite even for operators A with an 
unbound spectrum on "H, i.e. for which the full range A^ 
from (|37p is infinite. 

Second, we consider observables of the form 



B(b) 



K \n)(n\ 



(39) 



with arbitrary real coefficients b := (bo, &i, ...). In partic- 
ular, we can conclude from (f5]) that arbitrary functions 
g : R — > M of H are of this form, 



g{H) = B(b) if b n := g(E n ) 



(40) 



As it will turn out, it is sufficient to consider instead 
of A in (|38[) any observable of the form A — B(h) with 
arbitrary coefficients b. As a consequence, A^ from 
can be replaced by 



A" 



min{max('0|j4 — B(h)\ip) 

b 'H + 

-min(<0L4-£(b)|<0)} 



(41) 



where mirib indicates a minimization over all real coeffi- 
cients b := (bo, &i, ...). In particular, it follows that 

A£ = if A = B(b) or A = g(H) (42) 

for some set of coefficients b or some function g(x). 
From [|3TP. (1381) . and gl]) we see that 



(43) 



A'i<A' A < A A 



Rather than requiring that the full range-to-resolution 
ratio does not exceed 10 20 , as discussed below (|37]l . it 
will be sufficient in our central result below to similarly 
limit the reduced ratio: 



A^/SA < 10 2 



(44) 



V. REALISTIC INITIAL CONDITIONS 

A. Population density 

Our key requirement with respect to the initial condi- 
tion p(0) is that the concomitant ensemble averaged level 
populations (1231) can be written in the form 



p n = h(E n ) + 6p n 



(45) 



with a smooth function h(E) and "unbiased fluctuations" 
Sp n . Physically, h(E) thus represents a locally averaged 
level occupation probability, henceforth abbreviated as 
population density. 

To be more precise with respect to (145[) . we recall that 
for a system with / degrees of freedom, there are roughly 
lO ^- 1 energy eigenvalues E n per Joule, see Sect. Ill Al 
Assumption (|45[) means that within any energy inter- 
val around some reference energy E > Eq, which con- 
tains very many levels E n , but which is still exceedingly 
small on any experimentally resolvable scale, the ensem- 
ble averaged level populations p n can be split into an 
approximately constant "local" average value h(E) and 
"unbiased fluctuations" Sp n , i.e. the average over all 6p n 
belonging to this interval around E is negligibly small 
compared to h(E) itself. The key point is that h(E) 
must be independent of the exact choice of the consid- 
ered energy interval around E. 

Comparable assumptions of well-defined "local aver- 
ages" are tacitly taken for granted in many different phys- 
ical contexts. Likewise, we find it quite plausible that the 
ensemble averaged level populations p n , though largely 
unknown (see Sect. [I]), still satisfy our present assump- 
tion under experimentally realistic conditions. Further 
arguments are provided in Sect. IV Bl below. 

Finally, we emphasise once more that all these con- 
siderations concern ensemble averaged level populations, 
i.e. mean values over many repetitions of an experiment, 
which the experimentalist would denote as "identical" 
but which in fact are very different on the microscopic 
level, see Sect. HI 



B. System preparation 

The key idea is that the initial condition is the result 
of a preparation process, during which the system was 
not yet isolated, admitting conclusions about the initial 
condition itself. 

The simplest case consists in a time-dependent para- 
metric change of the Hamiltonian during the preparation 
phase. More complex, but ultimately applying to every 
real experiment, is some type of contact with the "rest 
of the world" prior to the actual isolation of the system. 

The first consequence is an entanglement with the rest 
of the world during the preparation phase (t < 0), im- 
plying that the reduced initial state (at t = 0) of the 
system (after tracing out the rest of the world) will be 
a mixed state even for a single realization of the experi- 
ment. Already this reduction step brings along a certain 
"randomization" of the system level populations. More 
importantly, there will unavoidably arise some kind of 
time dependencies of the system Hamiltonian during the 
preparatory period t < 0, the last of them being caused 
by the actual shutting down of all connections with the 
rest of the world. Such a time dependence of the system 
Hamiltonian is known to generically entail an approxi- 
mately diffusive "spreading" of occupation probabilities 
over neighboring energy levels [25j. Since the levels are 
so exceedingly dense, the diffusion will - already during 
a very short time span and even for a very weak time- 
dependence of the Hamiltonian - effectively lead to a 
diffusive randomization of the p„'s in accordance with 

In theoretical studies it is quite common to gener- 
ate the out of equilibrium initial condition by means 
of a "sudden" (discontinuous) parametric change of the 
Hamiltonian 0, [la, LL6( , called "quantum quench" . Such 
a procedure thus misses the above mentioned diffusive 
"spreading" of the occupation probabilities. 



C. Energy density 

The energy probability density, or energy density for 
short, is defined as 



p(E) := (S(E H)) 



(46) 



Accordingly, p(E) dE quantifies the ensemble averaged 
probability to find a value between E and E + dE when 
measuring the energy of the system. 

With g]), dH]), flit]), and @ it follows that 



(47) 



p(E) =5>„ S(E - E n ) 



independently of t. In the same spirit as around (1121) and 
in the discussion of h(E) below (|45l) . the delta-functions 
in (|4*rJ)) and P?) are understood to be "washed out" over 
many energy levels in order to give rise to a well-defined, 



smooth energy density. As a consequence one finds that 
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E. Physical arguments 



p(E) = h(E) ui{E) . 



(48) 



While a detailed derivation of this relation is provided in 
Appendix C , it is intuitively quite obvious: The prob- 
ability p{E) dE to encounter an energy between E and 
E + dE is equal to the locally averaged population h(E) 
of the energy levels multiplied by the local level density 
uj(E) times the interval length dE. 



D. Maximal level population 

To get a feeling for the exotic orders of magnitudes 
arising in the context of (|45|) . let us assume that there 
are exactly lO^ 10 > equally spaced energy levels E n per 
Joule and that our energy interval around E has a length 
between 10~( 10 )j and a few J. Then, our interval con- 
tains at least roughly lO ^ 10 ) energy levels. Assuming 
that h(E) is approximately constant within the inter- 
val, and zero outside, the normalization (|26[) implies that 
h(E) = Kr ' 1023 ) within the interval. Recalling that 
this is the local average value of p n , it seems quite rea- 
sonable to assume that all the individual p n values do not 
exceed the range between zero and 10^ 10 ) times the av- 
erage value h(E). Otherwise, the average over the Spn's 
would not be negligible compared to h(E) = lCP ' 10 ) 
for every possible choice of the interval. 

Returning to the general case, we can conclude that 
even if h(E) varies very fast on any experimentally re- 
alistic scales and even if the energy levels are populated 
extremely unequally, we still expect that max„ p n will be 
extremely small, typically 



max p r , 



io-°w . 



(49) 



On this rather heuristic level, (jlS"]) thus implies (|49|) . In- 
tuitively it even seems plausible that the two conditions 
are more or less equivalent. 

Next we remark that the mere existence of the level 
density f| 12[) implicitly takes for granted that the multi- 
plicities of degenerate energies are not exceedingly large, 
i.e. very much smaller than 10°^. There can be little 
doubt that this assumption will be fulfilled under all ex- 
perimentally realistic conditions. Under the very same 
assumption we can infer from (|25[) and (|49p the very 
rough estimate 



maxp B = 10~ o(/) . 



(50) 



Note that pE n is the occupation probability of E n from 
(f25j). and as such does not refer to any specific energy 
basis. In contrast, (I49|) is implicitly understood with re- 
spect to the specific basis introduced above (|2"T1) . This is 
the main advantage of (jSTJ)) compared to (l4"9)) . 



Besides those already discussed in Sect. IV Bl there are 
the following additional physical reasons to expect that 
(|49| and (|50|) are fulfilled under experimentally realistic 
circumstances. 

First, the time-energy uncertainty relation seems to 
prohibit for all practical purposes the determination of 
the system energy with a precision that would be neces- 
sary to populate only a relatively small number of levels 
with appreciable probability so that (|4TJ)) and (jSTJ]) would 
be violated 

Second, while an ideal energy measurement would in 
principle allow us to prepare the system at one specific 
energy eigenvalue, every real (finite resolution) measure- 
ment will result in appreciable probabilities of very many 
levels. 



F. Example ([35]) 

It is instructive to reconsider our example from (|35l) . 
(1361) and see what happens to the concomitant incom- 
patibility with equilibration in case we restrict ourselves 
to realistic observables and initial conditions, satisfying 
(|4"4"|) and (14T)1) . respectively. To begin with, one readily 
sees that the spectrum of A from (|3"5|) (within H+) con- 
sists of the two eigenvalues a± — ±|p rlTO (0)| _1 , and, in 
case diniH+ > 2, of one further eigenvalue ao — 0. With 
(EMI) . (|3T|) and ([38]) we can conclude that 



Aa = A4 = 2|o±| > 2/maxp n 



(51) 



A somewhat more tedious calculation shows that also 
A A ' from (|4ip coincides with Aa in our present exam- 
ple. For experimentally realistic initial conditions we can 
infer with (j^)) that A^ > O(IO^). For macroscopic sys- 
tems (/ 3> 1) it follows that the oscillations from (l3"6l are 
beyond any realistic experimental resolution limit SA ac- 
cording to P4"|) . The same thing may alternatively also 
be viewed as follows: Any single (ideal) measurement 
process always results in one of the three outcomes a+ , 
a_, or ao- Hence an infeasible number of repetitions is 
needed to resolve the order-one variations of the ensemble 
average (|3"6"1) . 

In short, while mathematically speaking the observ- 
able (l3"5j) indeed leads to perpetual oscillations ([3"fJf , such 
oscillations cannot be resolved in practice for experimen- 
tally realistic observables and initial conditions. 

The above example also suggests that our assumptions 
of experimentally realistic observables and initial condi- 
tions (or some similar restrictions) are almost unavoid- 
able for taming the oscillations in (|22|) and thus over- 
coming the concomitant incompatibility with the basic 
Statistical Mechanical claim of equilibration, see Sect. 

mi 
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VI. GENERIC HAMILTONIANS 

As detailed in Sect. HI the "true" Hamiltonian H of a 
given system is usually not known in detail. Therefore, 
we assume that these details are of "generic" character 
in so far as the level counting properties from Sect. Ill Al 
are satisfied and energy differences Ej — E k and E n — E m 
are never exactly equal apart from trivial cases. More 
precisely, we require that 



If Ej ^ E k and E m ^ E n 
then Ej -E k = E n - E m 
implies Ej — E n and Ek = E n 



(52) 



A condition similar to (1521) is well known under the 
names "non-resonance condition" or "non-degenerate en- 
ergy gap condition" and is considered to be satisfied by 
generic Hamiltonians, see e.g. [g, [a, [l(J, LL3j , an d> m P ar ~ 
ticular, Sect. 3.2.1 in [2g] and references therein. The 
essential intuitive argument is as follows: Consider an ar- 
bitrary "path" H(X) in the "space of all Hamiltonians", 
parameterized by A. In the absence of any special rea- 
sons like symmetries, it is quite plausible that every gap 
E n — E m evolves as a function of A somewhat differently 
than all the other gaps. While we cannot exclude that 
two gaps may happen to coincide for specific A- values, 
these special points are of measure zero. In other words, 
Hamiltonians with degenerate energy gaps are of measure 
zero compared to "all" Hamiltonians. 

We remark that our present condition is weaker than 
the usual non-resonance condition [g, [9j, llOL [13, 26] in so 
far as \52\) still admits the possibility of degenerate energy 
eigenvalues. 



VII. EQUILIBRATION FOR ISOLATED 

SYSTEMS 



Being confident that the above discussed conditions 
(JHJ, l[5U]). and dSU are fulfilled under many, if not all, 
experimentally realistic conditions, we henceforth take 
them for granted and turn to the question, in how far 
they are sufficient to yield equilibration, i.e. a stationary 
long time behavior of the statistical ensemble (cf. Sect. 

19. 



A. Equilibrium ensemble 

Given an arbitrary but fixed p(0) evolving according to 
(|22[) . we will see below that the pertinent equilibrium en- 
semble is given by the density operator (sometimes called 
the generalized Gibbs ensemble) 



where the time average of an arbitrary function or oper- 
ator h(t) is defined as 



h(t) := lim — / dt h(t) . 



(54) 



In other words, the equilibrium ensemble p eq is the time 
averaged "true" ensemble pit). As such, it is time- 
independent and moreover inherits all the defining prop- 
erties of a genuine density operator from pit). Namely, 
one readily sees that p eq is a non-negative, Hermitian 
operator of unit trace and satisfies the dynamics (|22l) . 

Making use of the specific basis introduced above ([2T]) , 
one can conclude - as detailed in Appendix D - from (|23|) 
and ([52]) that 



Pe 



^2p,in{0) \n)(n\ = ^Pn \n) (n\ 



(55) 



i.e., p e g amounts to the (time-independent) diagonal part 
of pit) from f2"2"j). 

Focusing on observables of the specific form ([39]) it 
follows with 1221) and (E3 that 



(56) 
(57) 



Tr{p(t)B(h)} = Tv{p eq B(b)}. 
In particular, we can conclude with (J40)) that 

Tr{p(t)g(H)}=Tr{p eq g(H)}. 
for arbitrary functions g : R — > K. 

B. Main result 

It readily follows from ([32]), ([53]), and flM]) that 



(A)(t)=Tv{ Peq A} 



(58) 



In other words, on the average over all times t > 0, the 
"true" statistical ensemble p(t) is indistinguishable from 
the equilibrium ensemble p eq . 

The natural next step is to consider the mean square 
deviation 



ai:=[(A)(t)-(A)(tW. 
The following relation is derived in Appendix D: 



a\ < (A^) 2 Tr{p%} 



(59) 



(60) 



where A^ 



i' 1 "-"* Tr IPe ? } 



is defined in flU]). The last 
in ([60]) is the so-called purity of p eq , i.e. the purity of 
the time-independent part of p(t), but not the purity of 
p(t) itself. In principle, pit) may even be a pure state 
(see above (I19[) ) with a purity of one, while the purity 
of the concomitant p eq may still be as small as lO^ '-^ 
according to ([50]) and the relation (|132j) in Appendix B. 
Observing that Tr{p 2 ? } = J2 n /°nn(0) according to ([55]) 
and introducing relation (|132[) from Appendix B into (pO")) 
we finally obtain 



Pe 



P(t) 



(53) 



al < (A^) 2 m&xp En 



(61) 



where ps n is the occupation probability of E n , see (|25|) . 
Considering Tr{p(i)^4} as a random variable, gener- 
ated by randomly sampling time points t according to a 
uniform distribution on [0, oo), the corresponding mean 
value and variance are given by ([58]) and (l59l) . The 
next step is to invoke Chebyshev's inequality [la, l22j> 
stating that for any random variable x with average p 
and variance a 2 and any given k > 0, the probability 
Prob(|ir — jj\ > k) that x deviates from // by more than k 
satisfies Prob(|x — p\ > n) < (ct/k) 2 . In our present case 
we thus can conclude that 



Prob \Tr{p(t)A} - Tr{p eq A}\ > SA < 



,(62) 



where SA is the resolution limit of A, see Sect. IIV1 With 
(|6ip we arrive at the first main result of our present pa- 
per: 



Prob(\Tr{p(t)A} 



Tr{p eq A}\ >SA 



< 



SA 



maxp£; n (|33) 



where pe„ is the ensemble averaged occupation proba- 
bility of the (possibly degenerate) energy eigenvalue E n , 
see (|25|) . We recall that the only ingredients in deriv- 
ing this result were the (generalized) non-resonance con- 
dition (I52|) and the assumption that the measurement 
range A% from (l4*Tj) is finite, cf. Sect [IVJ For the rest, 
(|6"3"| is a completely general and rigorous relation, for- 
mally valid for any choice of SA > 0. It generalizes the 
previously known result from [12], which did not admit 
degenerate energy eigenvalues, nor the minimization over 
arbitrary B(h) in (|4l"j). 



C. Discussion 

For realistic initial conditions and generic Hamiltoni- 
ans we can take for granted the rough estimate (1ST " 
yielding with (|M)l the result 



Prob \\Tr{p(t)A} - Tv{p eq A}\ > SA 



< 



SA 



10-°^(fi4) 



Focusing on large systems ([T]) we can conclude that for 
the overwhelming majority of times t > the difference 
between Tr{p(t)A} and Tr{p eq A} is way below the in- 
strumental resolution limit SA for any experimentally re- 
alistic observable according to (J44I) . In other words, the 
system looks exactly as if it were in the steady state en- 
semble p eq for the overwhelming majority of times t > 0, 
though the "true" density operator p(t) is actually quite 
different, see Sect. IIII1 This is the main result of our 
present paper regarding the question of equilibration, see 
Sect. ED 

Note that these conclusions do not really require a 
macroscopic number / of degrees of freedom. Put dif- 
ferently, our result also explains the common numerical 
observation that already quite small particle numbers of- 
ten equilibrate and thermalize surprisingly well. 



As promised in the introduction, the derivation of (f6Tj) - 
(|M|) is based on the exact Quantum Mechanical time 
evolution (|19[) -(|22 |) without any modification or approx- 
imation. In other words, the full Quantum Mechanical 
time-inversion invariance is still contained in (1641) . In 
particular, (|64p is compatible with the recurrence prop- 
erty of Tr{/?(£)^4} mentioned below (|M|) . but implies that 
such excursions from the "apparent equilibrium state" 
p eq must be exceedingly rare events. 

Exactly the same "apparent equilibration" towards p eq 
emerges if one propagates p(0) backward in time (keeping 
the system isolated also for t < 0). Along the entire 
real i-axis, an initial condition p(0) far from equilibrium 
thus closely resembles one of the above mentioned rare 
excursions, except that the location of this excursion is 
on purpose chosen as the time-origin. 

In other words, Quantum Mechanical time inversion 
invariance is maintained, but when starting out of equilib- 
rium, an "apparent time arrow" emerges with extremely 
high fidelity. 

Note that any single excursion of Tr{p(t)A} from the 
"apparent equilibrium value" Ti{p eq A} is a priori not 
expected to exhibit any special symmetry with respect 
to time inversion. Only the probabilistic properties of 
an ensemble of such excursions are, in the absence of 
magnetic fields, expected to satisfy a microreversibility or 
detailed balance type of symmetry with respect to time 
inversion. Note that these considerations apply both to 
"small" and "large" excursions. 

While (|6"l)l provides a bound for the relative amount 
of time the system exhibits notable deviations from equi- 
librium, the typical duration of one given excursion, or 
equivalently, the characteristic relaxation time of an out 
of equilibrium initial condition p(0) remains unspecified. 
Note that Statistical Mechanics itself also makes not 
statements in this respect. Hence, it is justified to omit 
them within a foundation of Statistical Mechanics. How- 
ever, we remark that since our assumptions on initial 
condition and Hamiltonian were very weak and we kept 
the exact Quantum Mechanical time evolution (TK)|) - (|2"2"1) . 
we expect that the actual relaxation time will be close 
to that of the real system we are modeling, provided this 
modeling is not too bad. Since one can easily imagine real 
systems with arbitrarily large or small relaxation times, 
any further quantification of the relaxation process in- 
evitably would require a considerably more detailed spec- 
ification of the Hamiltonian H, the initial state p(0), and 
the observable A. Thus, our main result (IM1) may well 
be already quite close to "the maximum one can say in 
full generality" . 

As mentioned above (fl~9|) and below (|60|) . in principle 
p(t) may even be a pure state of the form \ip(t)}(ip(t)\. 
In this case, the occupation probabilities pE n of the en- 
ergy eigenvalues E n appearing in (|63[) can be rewritten 
according to ([23)1 as 



(65) 



PE n = J2 IW*)|m)| s 



E m = E n 
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As long as all these occupation probabilities are small, 
e.g. satisfying the rough estimate ([SOI , the above relation 
([64]) and the subsequent discussion still remain valid for 
pure states p(t) = \ip(t)}(ip(t)\. 

If the system is prepared in a pure energy eigcnstatc, 
i.e. p(f) = p(0) = |n)(n| then A^ = for arbitrary A 
according to ((211) and fil]). In other words, ((Ml) still 
represents a tight upper estimate in this case, which in 
fact represents "the opposite extreme" compared to the 
property (j49f or (f50|) of experimentally realistic initial 
conditions. 

Likewise, for observables of the form (39) or of the form 
g(H) with arbitrary g we have A^ = in ((64[) according 
tog!]). 

We close with the following conceptual remarks re- 
garding the notion of "experimentally realistic observ- 
ables and initial conditions" . In Sects. [TV] and [V] we 
have specified certain properties which we are proposing 
to be necessary for observables or initial conditions to 
be considered as "experimentally realistic". But these 
properties are not meant to be sufficient. While es- 
tablishing such exact (necessary and sufficient) condi- 
tions is not the subject of our present work, we provide 
some simple example to illustrate our point: For any A 
and any fixed t, the observable B := XJ T AXJ\ satisfies 
Tr{p(t)B} = Tr{p(t-T)A} according to ([19]). Whenever 
A was realistic according to the criterion from Sect. IIV1 
the same applies to B (As = Aa). According to ([3"2"]l. B 
imitates (for any p(t)) the behavior of A with a time de- 
lay of r. If p(0) is a far from equilibrium initial condition 
and t exceeds the relaxation time for the time inverted 
dynamics, the observable B thus initially behaves as if 
the system were already equilibrated but then all of a 
sudden undergoes an excursion as if the system would 
transiently move very far from equilibrium. Turning to 
negative r values, B would represent a device which can 
"look" - in principle arbitrarily far - into the future. 
There can be little doubt that such observables are not 
"realistic". Likewise, for any given p(t) satisfying the 
Quantum Mechanical time evolution (fT9]) . a hypotheti- 
cal initial condition of the form p T (0) := p(—t) produces 
analogous "unrealistic" phenomena while being "exper- 
imentally realistic" according to Sect. [Vl whenever p(0) 
was so (the level populations of p(0) and of /5 T (0) are 
equal). To identify suitable criteria for sorting out such 
pathologies is a very subtle task, especially in view of the 
fact that so-called spin-echo experiments seem indeed to 
be able to realize such initial conditions p T (0) to some 
extent HI. 



VIII. THE PROBLEM OF THERMALIZATION 

According to ([55]) and the discussion below ([64]) . ex- 
pectation values (132[) become practically indistinguish- 
able from 



after initial transients have died out. In this respect the 
problem of equilibration raised in Sect. [Jean be consid- 
ered as settled and we henceforth can focus on ([66]) . 

Turning to the issue of thermalization, the key question 
is thus, in how far the equilibrium expectation value of 
A from (I66|) is in agreement with that predicted by the 
microcanonical ensemble, namely 



Tr{p™ c A} = Y,Pn ic A« 



(67) 



where the level populations p™ lc are equal to a normal- 
ization constant if E n is contained within a small energy 
interval 



I:=[E- AE, E] 



(68) 



Tr{p eq A} = y^p n A„ 



(66) 



and zero otherwise [291 ] . 

In case ([66]) and ([67]) yield measurable differences for 
experimentally realistic p(0) and A, the "purely Quan- 
tum Mechanical" prediction (1661) is commonly considered 
as "more fundamental" @, Ha, LL6| ■ From this point of 
view, our derivation of equilibrium Statistical Mechanics 
is complete, provided the latter is valid itself. 

What are these validity conditions, beyond which the 
microcanonical formalism of equilibrium Statistical Me- 
chanics may break down? 

A first well known validity condition for the micro- 
canonical formalism is, as said below (|67|) . that only E n 
within a small energy interval (|68p have a non-vanishing 
occupation probability. More generally, as already men- 
tioned in Sect. H] in equilibrium Statistical Mechanics it 
is taken for granted that the system energy is fixed up 
to unavoidable experimental uncertainties. On the other 
hand, realistic initial conditions according to Sect. [Vlin 
particular require that this energy uncertainty is much 
larger than lCP ^' Joule, which is obviously always ful- 
filled in practice, but we never introduced or exploited 
any type of upper limit for this uncertainty so far, i.e. 
the energy uncertainty may still be arbitrarily large in 
J6'4[ ) and H66]) . In other words, for large energy uncer- 
tainties, our key relation (|66[) remains valid, while equi- 
librium Statistical Mechanics is likely to become invalid. 
This is clearly a not at all surprising case of disagreement 
between (|BBJ and (JBTJ) . 

To avoid such "almost trivial" cases, we henceforth 
take for granted that the system energy is known up to 
an uncertainty AE which is as small as possible, but still 
experimentally realistic, cf. SectfVl 

A second (often tacit) validity condition of the micro- 
canonical formalism is that the expectation values ([6"T]) 
are required/ assumed to be (practically) independent of 
the exact choice of of the interval / in ([6"8"]l . i.e. of its 
upper limit E and its width AE. But essentially this 
means nothing else than: 

In ([66]) the details of p n are largely irrelevant. (69) 

The same conclusion (|69p follows from the equivalence 
of the microcanonical and canonical ensembles (for all 
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energies E), considered as a self-consistency condition 
for equilibrium Statistical Mechanics [20, [21| . 

Clearly, given property (p5j) holds, the expectation val- 
ues (f66|) and (|67)l are indeed practically indistinguishable. 

Our first remark regarding property (J69I) itself, is that 
no experimentalist can control the populations p n of the 
unimaginably dense energy levels E n , apart from the 
very gross fact that they are "mainly concentrated within 
the interval / from ([68])" . If the details would matter, 
not only equilibrium Statistical Mechanics would break 
down, but also reproducing measurements, in particular 
in different labs, would be largely impossible, see also 
the discussion in Sects. Q]and[V] Second, one can readily 
construct observables and initial conditions, being exper- 
imentally realistic according to our definitions in Sects. 
[TV] and E] but still violating §§§. The fact that equilib- 
rium Statistical Mechanics is known to have an extremely 
wide experimental applicability implies that our so far 
notion of "experimentally realistic" is still too general 
(see also at the end of Sect. I VII CI) . 

The simplest way to guarantee property (|69[) seems 
to require/assume that the expectation values A nn = 
(n\A\n) hardly vary within any small energy interval of 
the form ([55)) . This is similar in spirit to classical coarse 
graining, and, in fact, is part of a common conjecture 
about the semiclassical behavior of fully chaotic classical 
systems [30| . In particular, negligible variations of A nn 
for close by n-values imply Serdicki's "eigenstate ther- 
malization hypothesis" [31] (anticipated in [7J and revis- 
ited in Ref. [2j]), implying that each individual energy 
eigenstate \n) behaves like the equilibrium ensemble. 

An alternative way to guarantee property (|69[) follows 
from the argument by Peres [6| that even if the A nn may 
notably vary with n, the immense number of relevant 
summands in (|66|) may - for "typical" A and p(0) - lead 
to a kind of statistical averaging effect and thus a largely 
p(0)-independent overall value of the sum. 

Numerically, the validity and possible failure of such 
conjectures and of property (p5[) itself have been exempli- 
fied e.g. in @, 0, H, [32| ■ While the details - in particular 
the role of "more basic" system properties like "ergodic- 
ity" and "(non-)integrability" - are still not very well 
understood 0, [{J Is [30, [33, [34j , "equilibration" in agree- 
ment with ([66p was seen numerically in all known cases. 



H$, and Hilbert space T-Ls together with an environment 
(e.g. a heat bath, index "B") with fs degrees of free- 
dom, Hamiltonian Hb, and Hilbert space Hb- As usual, 
the environment is assumed to be macroscopic and much 
"bigger" than the system, i.e. 



/s>/s 



(70) 



The system S may or may not be macroscopic, i.e. fs 
may but needs not be a large number (in the range of 
10 23 ). On the other hand fs is of the order of 10 23 or even 
larger (e.g. if fs is already of this order). The system- 
plus-environment compound (total system, supersystem, 
no index) thus has 

/ = fs + Jb (71) 

degrees of freedom and "lives" in the product space 



H:=Hs®H 1 



(72) 



The contact (coupling) between system and environ- 
ment is described by an interaction Hamiltonian Hi nt : 
H —¥ H and a "coupling strength" A, resulting in a total 
Hamiltonian of the form 



H{\) = H S ® 1 Hb + \ ns ®H B + \ Hi, 



(73) 



where 1% S indicates the identity on %s, and similarly for 

We will mainly be interested in observables which only 
concern system properties, i.e. which are of the form 



A = A s ® h 



(74) 



Within this general framework, we take for granted 
that all conditions for equilibration of the isolated 
system-plus-bath compound in the sense of Sect. IVIII 
are fulfilled, i.e. the observables (f74"[) satisfy (|4"4"[) , the 
ensemble averaged energy level populations satisfy ([50p . 
and the Hamiltonian ([73[) satisfies the generalized non- 
resonance condition (J52[) . As expected and demonstrated 
in detail later, these requirements in particular rule out 
A = 0. A further requirement is the subject of the next 
subsection. 



IX. CANONICAL SETUP 

The objective in the remainder of the paper is to estab- 
lish thermalization without making use of the unproven 
property ([r?§[) . While the general case seems extremely 
difficult to tackle, as discussed above, we focus on the 
most important special case, namely the so-called canon- 
ical setup: an isolated compound system, consisting of a 
(sub-)system which is weakly coupled to a "heat bath" . 

In analogy to Sect. [H] the starting point is a sys- 
tem (subsystem, central system, system of actual inter- 
est, index "S") with fs degrees of freedom, Hamiltonian 



A. Weak coupling condition 

Some kind of weak coupling assumption is an indis- 
pensable (though often tacit) prerequisite of the canon- 
ical formalism. The simplest possibility would be to re- 
quire that A in ([73| is so small that the eigenvectors \n) 
and eigenvalues E n of H{\) deviate only very little from 
those of H(0). However, according to ordinary perturba- 
tion theory, these deviations will be governed by terms of 
the form X(m\Hi nt \n} /[E m — E n j. Since some E m — E n 
are of the order of 10 ^^ Joule (see below {11])), the 
admissible A-values would be so small that no realistic 
model would satisfy the condition. 
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For this reason, we henceforth focus on system-plus- 
bath compounds ([73"]) which satisfy the following "op- 
erational" weak coupling condition: After equilibration 
of the system-plus-bath compound, a reversible (adiabat- 
ically slow) decoupling of the system from the bath does 
not lead to any experimentally resolvable changes. 

As far as expectation values of system observables A 
are concerned, we recall that the "true" , time-dependent 
expectation values (A) (t) may, even after equilibration, 
still exhibit quite notable "excursions" at exceedingly 
rare time points t (cf. Sect. IVII| . In this case, the above 
weak coupling condition tacitly refers to time-averaged 
expectation values. 

The quantities, for which the weak coupling condition 
will actually be taken for granted later on, are observables 
of the form (f74|) and the energy density (j46|) of the total 
system-plus-bath compound. 

Physically, there can be little doubt that most real sys- 
tems in contact with a heat bath satisfy the above weak 
coupling condition. Hence, the same is expected for "re- 
alistic models" of such systems. Yet, in view of the per- 
turbation theoretical considerations above, a mathemat- 
ical proof for any given model seems extremely difficult. 
In fact, the condition concerns not only the Hamiltonian 
([73]) but simultaneously the observables (|74|) and initial 
conditions p(0) (cf. Sect. |VJ of the system-plus-bath 
compound. The nature and difficulty of the problem may 
become more evident by considering a particularly "sim- 
ple" special case, namely observables of the form ([74]) 
and canonical density operators. In this special case, we 
may consider the relation 



-H(\)/k B T . . 

Tr<— a[=Tv s " 



Z{\) 



-H s /k B T 



Z s 



■A" 



(75) 



with A from (JT^J), H(X) from ([73]), Tr and Tr s indicat- 
ing the traces over the Hilbert spaces H and Hs m ([72]) . 
respectively, and Z(X) and Z$ representing standart par- 
tition sums, normalizing the respective density operators. 
This relation (1751) is an elementary identity for A = 0, is 
usually considered as "obvious" for "weak coupling" , but 
to the best of the present authors knowledge is unproven 
(and probably wrong and thus unprovable without strong 
extra assumptions on A ) for small but still experimen- 
tally realistic coupling strengths A. 

Since the solution of these long standing and very sub- 
tle problems is not the actual main theme of our present 
work, we adopt the standpoint that the above weak cou- 
pling condition is an implicit additional requirement re- 
garding experimentally realistic Hamiltonians, observ- 
ables, and initial conditions on top of the previous re- 
quirements ([44]) . ([50]) . ([52]) . In all what follows, we focus 
on systems which satisfy all those requirements, being 
confident that they include the majority of experimen- 
tally realistic models. 



B. Main implications of weak coupling 

The "true" time-averaged expectation values are, ac- 
cording to (|55|) . given by 



(A)(t)=Tr{p eq (\)A} 



(76) 



where the argument A of p eq has been added to remind 
us of the fact that we are dealing with the given, "true" 
Hamiltonian (73) with a "small" but non- vanishing cou- 
pling strength A. Likewise, the "true" density operator 
from (1221) is now rewritten in the form 



P(t) 



] Pm n(0)e-^- (X »D mn (\) (77) 



,(A) := [E m {\) - E n (\)]/h, D mn {\) := 
|m(A))(n(A)|, and where |n(A)) and E n (\) refer to the 
eigenvectors and eigenvalues of the Hamiltonian (73) for 
an arbitrary but fixed A-value. 

As soon as A in ([73"]) starts to change in the course of 
time, the density operator is no longer given by ([22]) but 
rather follows from the Liouville-von Neumann equation 



thp(t) = [H(X(t)),p(t)} 



(78) 



While an explicit solution for general protocols A(i) is 
hopeless, in the special case of adiabatically slow (quasi- 
static) parameter changes, the Adiabatic Theorem can 
be invoked to yield 

p(t)=Y,P^(0)e- lf ^- {Xis))ds D mn (\(t)) , (79) 



where we have tacitly restricted ourselves to the simplest 
and most relevant case of non-degenerate energy levels, 
see also Appendix E. 

The adiabatically slow decoupling process appearing in 
the weak coupling condition (Sect. ILXAp means that A(t) 
in (|79|) is given during a large but finite initial time span 
by the "true" , finite coupling strength, then adiabatically 
slowly changes to the value zero, and afterwards remains 
zero for all later times t until infinity. Consequently, the 
time average of the density operator in (1791) is governed 
by the infinitely long time period with X(t) = 0, i.e. 



p(*) = I>«»(0)K0)}<n(0)| 



(80) 



Invoking the weak coupling condition from Sect. ILX Al it 
follows that the "true" time-averaged expectation values 
([76]) are practically indistinguishable from those obtained 
with the help of i 



(81) 



(A)(t)=J2Pn{n(0)\A\n(0)) 



where Pn := Pnn {0) = (n(A(0))|p(0)|n(A(0))> are the 
level populations of the "true" system at time t = 0, 
cf. (|23[) . In other words, though the true system-bath 
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coupling strength X is finite, we can formally work in the 
zero coupling limit as far as time-averaged expectation 
values are concerned. 

A well-known further consequence of the Adiabatic 
Theorem (|T5|) is the time- independence of the level popu- 
lations (n(X(t))\p(t)\n(X(t))) which can thus be identified 
with p n = /9 rm (0) for all times t. Considering A rather 
than t as independent variable, we may equivalently say 
that the p n are A- independent. It follows that in the re- 
lation p n = h(E n (X)) + 6p n from (|45|) . the right hand side 
must be A-independent in the same sense. By locally av- 
eraging over many n- values (see below ({35} and [35|) we 
can conclude that also h(E n (X)) must be A-independent. 
In view of the A-dependence of E n (X) it follows that also 
the function h(E) generally must acquire a dependence 
on A. Indicating this fact by adding an index A to h{E) 
we can conclude that 

h x (E n (X)) = h (E n (0)) (82) 

for all n, and that the relation (j45|) now takes the form 

Pn = h X (E n (X)) + S Pn (83) 

with A-independent p n and Sp n . Likewise, the density 
of states (fT2|) and the energy density (l47l) acquire a A- 
dependence and thus are now denoted by cu\(E) and 
Px(E) respectively. Similarly as in (j48|) one can con- 
clude Kja that 



po(E) = ho{E)oj (E) 



(84) 



Finally, the weak coupling condition as discussed in 
Sect. I IX Al includes the practical indistinguishability of 
the "true" energy density from the energy density after 
decoupling the system from the bath, i.e. 



p x (E)=p (E) 



C. Zero coupling limit 



(85) 



We consider the Hamiltonian ((73")) in the limit of van- 
ishing coupling strength A. Denoting by |n)g and £"„ the 
eigenvectors and eigenvalues of Hs and by \m) b and E^\ 
those of Hb , those of (f73l) with A = follow as 



E,, 



\n)s\m) B 

E, n + E^ 



(86) 
(87) 



1. Violation of the non-resonance condition 

The fact that systems consisting of non-interacting 
sub-systems require special attention with respect to the 
generalized non-resonance condition (|52|) has first been 
noticed in Ref. (131. 



Taking into account that the original indices in (|52j) 
now become double indices according to (|87|l. condition 
([52]) takes the modified form 

"■ Ej 1 j 2 T 1 E/ Cl k 2 an °- E mim2 f= E nin2 



then Ei, ,„ — Ej Cl k 2 = E ni7l2 — E„ 



implies Ej i:j 



E„ 



and Ei. 



k\k 2 



E„ 



One readily sees that this condition is violated by con- 
sidering the following specific choice [13(: j\ — ji — 
k\ = U2 =: n and ki = mi = rri2 = n\ =: m. In the 
generic case, it will be possible to find indices n and m 
so that all four energies E nn , E nm , E mn , E mm appear- 
ing in (88) in are different. With ([37]) it follows that 



E,, 



E„ 



In other words, condition 



is violated. 



2. Exploiting the product basis 

The purpose of this subsection is to rewrite the perti- 
nent relations from Sect. IIXBI in terms of the product 
energy basis ([86| and the corresponding energy eigenval- 
ues (|87|) in the zero coupling limit. 

Essentially, we are just relabeling all eigenvectors and 
eigenvalues in a way which will turn out particularly con- 
venient later on. Namely, all the single labels n(X) are 
now replaced by double lables nm(X) with the additional 
convention that the argument A will be omitted in the 
case A = 0, in agreement with the notation in Sect. 
IIXC II Note that only in the zero coupling limit does 
the first of the two indices in nm refer to the system and 
the second to the bath, but not any more for A ^ 0. 

Specifically, |n(0)) and E n (0) from Sect. IIXBI are now 
denoted as \nm) and E nm and satisfy (|86|) and (|87|) . Like- 
wise, \n{X)) and E n (X) are now denoted as \nm(X)} and 
E nm (X). Finally, the A-independent level populations p n 
now become p nm . 

For observables of the form (|74l) the time-averaged ex- 
pectation value (|8T|) thus can be rewritten as 



(89) 



(A)(t) = ^p nm s(n\A s \n) s 

run 

= J2Pns(n\A s \n) s 

n 
Pn := / .Pnm ■ 



" - Y,p™ ■ ( 9 °) 



Next, we rewrite the p nm according to (|53")) as 

Pnm = h x (E nm (X)) + Sp nm (91) 

Regarding the "unbiased fluctuation" Sp nm (cf. Sect. 
IV A| can conclude that for any fixed index n, the sub-set 
o~Pnm with variable indices m = 0,1,2,... is unbiased as 
well, i.e. 



y^JW = },h\(E nm (X)) 



(92) 
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More precisely, the energies E nm from (|87|) . with n ar- 
bitrary but fixed and m variable, are still unimaginably 
dense, and the same property is inherited by E nm (X). 
Further, there is no reason to expect the emergence of 
any special "correlations" between the ensemble averaged 
fluctuations Sp n m by selecting any sub-set with a fixed n 
in (EH). 

Note that during the preparation phase (cf. Sect. 
IV B[) . there may exists an indirect, non-weak coupling 
between system and bath in the canonical setup, namely 
when both parts are interacting simultaneously with the 
rest of the world. Then, it is even more obvious to expect 
that ([92]) will be canonically fulfilled. 

Rewriting (J82J) as h\{E nm {\)) = ho(E nm ) it follows 
with ((901) and d92J) that pi = J2 h {E nm ) and with ([84]) . 
(J5S]) that pi = J2P*( E nm)/uo(E nm )- Dropping indices 
"0" corresponding to A = as usual, we finally arrive at 



P'n 



Z^ 



p(E nm ) 
u(E nm ) 



(93) 



where p{E) stands for the "true" energy density p\{E) 
of the system-plus-bath compound at finite coupling A. 



3. Additivity of Entropy 

Here, we revisit the issues of level counting, entropy 
and temperatures from Sect. Ill Al for the special Hamil- 
tonian (l73|) with A = 0. 

The definitions and relations ([9"T)-(fT7l) can be taken over 
without any change to our present special case, except 
that all single indices n now become double indices nm 
(see (f86]h ((87))). We reiterate that all those definitions 
and relations remain for the moment on a purely formal 
level without any reference to the actual system state. 
Their only purpose at this stage is to count levels in a 
convenient way. 

Since A = in (|73|). we are dealing with two individ- 
ual isolated systems and hence analogous definitions and 
equations as in (|9j)- (fP7|) . but now with indices "S" and 
"B" , apply separately to the system and to the bath. As 
detailed in Appendix A, the following relations between 
those separate system and bath quantities and the orig- 
inal quantities for the system-plus-bath compound (|T3[) 
with A = can be established: Focusing on E > Eo, we 
denote by E B {E) the maximum of S B (E') + S S (E- E') 
with respect to £", i.e. 

E B {E) := ma,x&Tg{S B (E') + S S {E - E')} . (94) 

E' 

In the generic case this maximum is unique and is con- 
tained in the interval (Eq, E). Adopting the definition 

E S {E) := E - E B (E) , (95) 

the following results are established in Appendix A: 

S(E) = Sb(E b (E)) + S s (E s (E)) (96) 

T{E) = T b (E b (E)) = T s (E s (E)) (97) 



More precisely, these relations are asymptotically exact 
approximations for f B — > oo. But since f B is at least of 
the order of 10 23 (see below ((70|)) they are satisfied with 
extremely high accuracy. 

In other words, in the zero coupling limit the en- 
tropies of the systems-plus-bath compound exhibit an ad- 
ditive behavior provided the total energy E is distributed 
among system and bath such that S B (E') + Ss(E — E 1 ) 
is maximized, which in turn has the consequence that all 
temperatures are identical, i.e. the so-called equilibrium 
condition (or zeroth law of thermodynamics) is fulfilled. 



X. OUTLOOK ON GENERAL 
NON-INTERACTING SYSTEMS 

Systems consisting of non-interacting particles or other 
types of non-interacting sub-systems are popular models 
in many different contexts. Also the canonical system- 
plus-bath setup from the previous section is of this struc- 
ture. Strictly speaking, every single sub-system would 
thus be isolated and its energy would be a conserved 
quantity, thus prohibiting any kind of equilibration or 
thermalization process between different sub-systems. To 
each of them, the discussion of thermalization and and 
the concomitant open questions from Sect. IVIIII apply. 
In particular, individual sub-systems which are "small" 
(e.g. single particles) may not even exhibit equilibration 
(cf. Sect. 13}. Accordingly, the term "non- interacting" 
actually means an interaction which is strictly speaking 
finite, thus giving rise to "normal" equilibration and ther- 
malization, but in certain other respects still "negligibly 
small" . A more precise formulation of such a weak cou- 
pling condition and its implications analogously to Sect. 
IIXI is straightforward. 

Similarly as in Sect. IIXC H one finds that the general- 
ized non-resonance condition (|52|) is violated at zero cou- 
pling 13] but is generic-ally restored as soon as the slight- 
est interaction between the sub-systems is included. Note 
that in Sect. IIXC II it is tacitly assumed that the two 
non-interacting sub-systems are distinguishable. In the 
opposite case of indistinguishable sub-systems (e.g. in- 
distinguishable particles) , all admitted states of the com- 
pound system must be symmetric (Bosonic sub-systems) 
or anti-symmetric (Fermionic sub-systems) against ex- 
changing the indices of the two sub-systems, i.e. the 
pair (n, m) has to be identified with (is indistinguishable 
from) (m,n) for all n,m in (|86[) . (J87I) . Accordingly, in 
the identity E nn — E nm = E mn — E mm discussed below 
(88), the two energies E nm and E mn must be identified. 
Yet, condition (88) is still violated. 

These considerations demonstrate that the generalized 
non-resonance condition (1521) is a quite sensitive criterion 
in the context of equilibration and thermalization. It thus 
seem likely that a significantly weaker but still completely 
general condition of this type may not exist. 

Note that a system of strictly non-interacting particles 
(or other kinds of sub-systems) which is coupled to a bath 
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(canonical setup) generically gives rise to a total system- 
plus-bath compound which cannot be decomposed into 
isolated sub-systems any more, and therefore generically 
fulfills condition ([52]). 

Beyond the realm of "weak coupling" , a partitioning 
of the total compound into physically meaningful sub- 
systems becomes questionable. In particular, it does not 
make much sense to speak about properties of one "sub- 
system alone". Rather, the total compound amounts 
to an isolates system without any special properties, to 
which the general discussion form Sect IVIlil applies. 



XI. THERMALIZATION FOR THE 
CANONICAL SETUP 

The objective of the present section is to establish ther- 
malization without making use of the unproven prop- 
erty (|69p in the most important special case, namely the 
canonical setup from Sect. IIXI 

As detailed in Sect. IVIIIl it is taken for granted that 
the system energy 



E* := (H) 



(98) 



is known with high accuracy. In other words, the en- 
ergy density p{E) from (|46|) exhibits a very narrow peak 
within a close vicinity of E* . By combining (|46| and (|98|) 
one recovers the expected relation 



E* 



dEEp(E) 



(99) 



The "star" in E* emphasises the fact that the ensemble 
averaged energy of the "real" system is fixed. Identifying 
E in Sect. IPC C 31 with E* establishes the connection be- 
tween the so far unrelated considerations in Sect. IIX C 31 
and our present issue of thermalization. 

Formally, the task is to show that in ([55]) the unknown 
details of p n are (practically) irrelevant. In particular, 
this then implies that (j66|) indeed agrees with the predic- 
tion of equilibrium Statistical Mechanics. 

As detailed in Sect. 11X1 the canonical setup consists 
of a "small" system (S) which is weakly coupled to a 
much "bigger" bath (B). Equilibration of the isolated 
system-plus-bath compound is taken for granted, i.e. ex- 
pectation values (32) become practically indistinguish- 
able from (|55|. Focusing on system observables of the 
form (|74|) and observing (|76|) and ([89]) , those equilibrium 
expectation values thus take the form 



(A) - Tr s {p% A 3 } 

^Pn \ n )ss(n\ 



Pc„ 



(100) 
(101) 



where Trg indicates the trace in T-Ls- In other words, as 
far as system properties are concerned, the knowledge of 
the reduced equilibrium density operator pf : Hs —> T~is 
is sufficient. 



A. Boltzmann-form of p„ and canonical density 

Exploiting (|93)) we can conclude that 

Pn= [ dE £§) E^-^™) ( 102 ) 

where the delta-function is, as usual, considered as 
washed out. The first term under the integral can be 
rewritten by means of (1141) 

(103) 



P{E) = p(E) kBT{E) 



u(E) 



Q{E) 



Turning to the second term under the integral, we exploit 
(|57j) . the definition of ojb(E) analogous to (fl"2"|) . and the 
corresponding relation (|14p , yielding 



E S(E - E nm ) =J2 5 ( E - E n- E n 



= w B {E-E>) = 



s ,_ Q B (E-E^) 



k B T B (E-ES) 



(104) 



Making use of flTUJ), (fl03|) . and (|T04l) we can rewrite (fT02|) 
as 

p s n = [ dE p{E)—^—,^ (105) 



Q(E) 



Tb(E~ES) 
S B (E - E s n ) - S(E) 



(106) 



where the dependence of Q on n has been dropped. Ex- 
ploiting ([9"5"j) and (|9"6"|) we see that 

k B Q(E) = S B {E B {E) + E S {E) - E*) 

-S B {E B {E))-S S {E S {E)). (107) 

According to the mean value theorem there exists for any 
given x-and y-value at?6 [0, 1] with the property that 

S B {x + y) = S B {x)+yS l B {x + dy) (108) 

Choosing x = E B (E) and y = Es(E)—E^ and exploiting 
([13]) . we can rewrite (|107l) as 

_ E S (E)-E* S S (E S (E)) 

Q{E) " k B T B (E B (E) + A) fc^ (1 ° 9) 

A := d [E S (E) - E%] , (110) 

where the dependence of & and A on £ and n has been 
dropped. 

We first consider the simplest case of a delta- 
distributed energy density 



p{E) =S(E-E*) 

Thus, (fTUS]) takes the form 

s T(E*) , 

Pn - T B (E*-ES) e 



>(£*) 



(111) 



(112) 
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Observing (1951) . the denominator Tb{E* — E%) can first 
be rewritten as T B {E B {E*) + E S {E*) - E°) and then 
with (fTTjl) as 



T B {E*-E s n )=T B {E B {E*)) \^ + ° ( %%*)-% 

(113) 
Finally, with (J9TJ) and relations like in ([15]) but with in- 
dices S and B we can conclude that 



T B (E* -E%) =T{E* 



1 + 



fs E S (E*) - £ff 

f B E s (E*)-El 



(114) 

In view of (|70p , the last summand is negligible and (|112[) 
takes the form 



Vl = e^ 



(115) 



Similarly as in (J114I) . one sees that T B (E B {E) + A) ap- 
pearing in (|109|) can be approximated by T B (E B (E)) = 
T(E), yielding 

am = ^;;-5 - ^m . (116) 



k B T(E* 



fcs 



With the usual definitions of the free energy Fs(E) and 
the partition sum Zs{E) of the system 5*, namely 

f s (£) := £ s (£)-r s (£)s s (£) (117) 

Z S (£) := e -Fs(E)/k B Ts{E) (n8) 

in combination with (116), we can rewrite (115) as 
1 



P„ 



Z S (E*) 



-E^/k B T(E* 



(119) 



Taking into account the normalization condition 
YlnPn = 1) we recover 



Z s (E*)=J2 e ~ EfJkBT(En 



(120) 



We finally turn to general energy densities p(E). Since 
p{E) enters linearly in (|105|) , we simply can superimpose 
the results (|119[) for sufficiently many delta-functions ap- 
proximating the true p(E). The fact that each delta- 
function brings along a somewhat different value of E* 
in (|119[) has a negligible effect according to (TT6"j) as long 
as p{E) is still sharply peaked about its mean value. De- 
noting this mean value, in accordance with (|98l) and (IM1) , 
again by the symbol E*, one thus recovers exactly the 
same relations as in (|117[) - (|120j) . 

In summary, the canonical formalism (|117p - (|120j) is 
valid in full generality. Apart from the average energy 
E* , all the remaining details of the (unknown) energy 
density p(E) do not matter. The Boltzmann-distribution 
(|119p together with (|101[) yields the canonical density op- 
erator 



Peq 



1 



Z S {E* 



-Hs/k B T(E*) 



(121) 



where T(E*) is the temperature corresponding to the 
given total energy E* of the isolated system-plus-bath 
compound. In practice, this energy E* is usually not 
known, and one thus rather considers the temperature T 
as "given". Accordingly, in (|119p - (ll2ip the state func- 
tion T(E*) is replaced by the "new" independent state 
variable T and similarly Z S (E*) by Z(T) := Z S (E*(T)). 



XII. SUMMARY AND CONCLUSIONS 

In the first part of this paper we considered general, 
isolated quantum systems with many degrees of freedom 
/, and being extensive in the sense of Eqs. (fTTj) and 
(fT5"l) . As a further "generic" property of the Hamilto- 
nian H, the (generalized) non-resonance condition (|52p 
was taken for granted. Our key assumptions concern- 
ing the "realistic modeling" of actual experimental sys- 
tems were: (i) observables have a "reasonably bound" 
range-to-resolution ratio and (ii) initial conditions may 
be arbitrarily out of equilibrium but, on the average over 
the entire statistical ensemble (many repetitions of the 
"same" experiment), they give rise to a well-defined pop- 
ulation density (average occupation probability of many 
neighboring energy levels). The latter assumption seems 
quite plausible per se, but can also be justified via the 
experimental preparation procedure at the origin of the 
initial condition. 

All further "details" of the initial condition and the 
Hamiltonian were left unspecified, reflecting the unavoid- 
able actual lack of knowledge in this respect. 

Given the initial condition, the exact standard Quan- 
tum Mechanical time evolution was adopted without any 
approximation or modification. 

Our first main result (|64p implies that after initial 
transients have died out, the system looks for all prac- 
tical purposes as if it were in a steady state described 
by the so-called generalized Gibbs ensemble p eq , in spite 
of the fact that the "true" density operator p(t) never 
becomes stationary, but rather exhibits the well-known 
Quantum Mechanical recurrence and time inversion in- 
variance properties. Our key conclusion was that the 
mathematically undeniable differences between the "ap- 
parent equilibrium" p eq and the "true" density operator 
p(t) are either unobservably small or unobservably rare 
in time. 

While the issue of equilibration can thus be considered 
as settled, that of thcrmalization still remains an open 
problem as far as completely general isolated systems are 
concerned, as detailed in Sect. I Villi 

In the second part of the paper we focused on a special 
case of foremost practical relevance, namely the canonical 
setup, consisting of a system of actual interest (that may 
be macroscopic or not) which is weakly coupled to a much 
"bigger" environment. Provided the total system-plus- 
bath compound satisfies the above conditions for equi- 
libration, the corresponding "apparent equilibrium" p eq 
reduces, after eliminating (tracing out) the bath, to the 
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canonical density operator (|121|) . independently of all the 
unknown "microscopic details" of the possibly far from 
equilibrium initial condition. In other words, the "small" 
system is proven to exhibit "thermalization" . The result 
even goes beyond the claim of Statistical Mechanics in 
so far as not only the system but also the bath may be 
initially out of equilibrium. 

It seems not unlikely that our main prerequisites in 
deriving these results cannot be substantially weakened 
any more: Systems which can be decomposed into strictly 
non-interacting sub-units (e.g. non-interacting particles) 
are known not to thermalize, and indeed violate the non- 
resonance condition (|52p . Likewise, when either an un- 
limited range-to-resolution ratio or an initial condition 
without a well-dehned population density is admitted, 
one readily finds examples which do not exhibit equili- 
bration, see Sects. Mil and IV Fl 

While our main focus has been on statistical ensembles, 
we have noticed in Sect. IVIICI that all the above results 
also remain valid for pure states \ip(t)), provided the ini- 
tial energy level occupation probabilities |(-0(O)|n)| 2 sat- 
isfy the condition that a well-defined population density 
exists. Specifically, within the canonical setup the re- 
duced system density operator will again be practically 
indistinguishable from the canonical ensemble for the 
overwhelming majority of times t. This result is closely 
related to the issue of "canonical typicality" from Refs. 

iiiiiiiniEizi- 

Next, we briefly address the issue of low temperatures. 
First, for extremely low temperatures, the rough esti- 
mates from dTTJ) and (fT5|) may break down. Since these 
estimates are at the heart of our present approach, also 
our main results may not be valid any more. Essentially 
this happens when the entropy becomes experimentally 
indistinguishable from zero. This may, but need not be 
the case for Bose-Einstein condensates jl4| . 

Further, the common notion that a Bose-Einstein con- 
densate exhibits a macroscopically populated ground 
state may be easily misunderstood in our present con- 
text. Namely, this notion refers to the fact that the total 
many particle product state contains a large number of 
single particle ground states. The word ground state thus 
refers to the individual (non-interacting) particles, not to 
the total many particle system. Indeed, besides the nu- 
merous single particles in their individual ground state, 
there may still remain many further particles which are 
in excited single particle states. Hence, we are in fact not 
dealing with the actual ground state of the many particle 
product Hilbert space. Rather, it may easily happen that 
the maximal population of all the many particle product 
states is still small and hence the conditions regarding 
the level populations from Sect. IVlmav still be satisfied. 

We close with a few remarks regarding the issues of 
(non-)integrability, ergodicity, chaos, decoherence, and 
entanglement. We first remark that (non-)integrability, 
ergodicity, and chaos are relatively well defined notions 
for classical systems, but that their role with respect to 
equilibration and thermalization is not really clear in the 



classical case. The corresponding notions in the realm of 
quantum systems are much less well and uniquely defined 
2, 33]. But even if this problem would be solved, in 
view of the classical situation, the usefulness of those 
concepts for equilibration and thermalization are likely 
to be limited in the quantum case as well. 

The closest connection of our present approach to the 
above notions may be via the non-resonance condition 
(f52"|) . However, such a connection or analogy does not 
seem to offer any additional physical insight. Concerning 
our requirement that the initial condition must exhibit a 
well defined population density, we remark that during 
the preparation phase (which represents the physical ori- 
gin of the initial condition), a distinction between inte- 
grable and non-integrable systems does not make much 
sense anyhow. 

The concepts of integrability, ergodicity and the like 
may well play a crucial role for the following two issues: 
(i) The transient relaxation-process of the initial state 
towards equilibrium, both qualitatively (exponential de- 
cay or not) and quantitatively (estimating the relaxation 
time). This is suggested by the well-established role of 
level statistic in quantum chaos and the importance of 
energy differences throughout the present Appendix D. 
(ii) The general problem of thermalization addressed in 
Sect. I Villi in particular the unproven key postulate ([55)1 
in this context. 

As far as the issue of equilibration is concerned, our 
present approach and results demonstrate that entangle- 
ment and decoherence play no role since we are dealing 
with isolated systems without any external influence by 
the rest of the world. With respect to thermalization, 
the question remains open. 



A. Comparison with related works 

We first address several works with a finite but not 
too small overlap with the present one and then turn to 
the two most closely related Refs. [l^aLUl- The pertinent 
literature regarding the "missing link" (|69| in the context 
of thermalization has already been addressed at the end 
of Sect. IVHll 

Considering and estimating quantities like (|59|) is very 
natural and has a long tradition: Merits and shortcom- 
ings of the early works are reviewed e.g. in [5j, most no- 
tably Ludwig's approach [3j. In particular, many of them 
[J, Q involve an extra average over initial conditions with 
the effect that any specific non-equilibrium initial condi- 
tion (representing a given experiment) must be excluded 
as "potentially untypical" from the general conclusions. 
Turning to the more recent precursors, Peres' approach 
Q is roughly comparable to ours up to Eq. (|149[) but 
then proceeds with the conjecture that the A mn are pseu- 
dorandom matrix elements, statistically independent of 
the p nm , for which there are general arguments [6| and 
numerical evidence [32[ (and counter-evidence [2|) but no 
proof. For pure states, Srednicki obtained similar results 



18 



Q by exploiting a common conjecture about the semi- 
classical behavior of classically smooth observables A in 
systems with a fully chaotic classical limit. Again, this 
conjecture is based on good arguments 30] but no proof. 
Moreover, typical classical many-body systems are not 
expected to behave fully chaotic [23, |38[ ■ Somewhat sim- 
ilar conclusion have been reached even earlier by Deutsch 
[8| via additional hand waving arguments. Finally, rig- 
orous results comparable to (|63|) are due to [l(J, LL6| , but 
only for rather special Hamiltonians H and initial condi- 
tions. Within the same restrictions, the ground breaking 
work by Tasaki [lOj also addresses the issue of thermal- 
ization by arguments which are similar in spirit to those 
in our present Sect. 



The first part of the present work (until the end of Sect. 
IVIIip represents a generalization and more detailed ex- 
planation of the Letter [12J . The main extensions consist 
in the enlarged class of observables admitted in Sect. [TV] 
and the fact that degenerate energy eigenvalues are not 
any more excluded in our present work. 



We finally turn to the closely related work 13]. In con- 
trast to our present work, Ref. [l3j is focused on Hilbert 
spaces which are finite dimensional and which exhibit a 
"system-plus-bath" product structure of the form ([72). 
Apart from a non-resonance condition (excluding degen- 
eracies, see below ([52]) ). the Hamiltonian may still be 
completely arbitrary. Further, the system is assumed to 
be in a pure state on the total system-plus-bath Hilbert 
space, while the obtained results mainly concern the re- 
duced (usually mixed) state of the "small" system after 
tracing out the "large" bath. Apart from these quite 
significant overall differences, the main findings with re- 
spect to equilibration are rather similar in character to 
ours. In particular, observables with finite range are im- 
plicitly taken for granted according to the discussion be- 
low Eq. (3) in Ref. [la ], and the "effective dimension" 
d e ff from [13| is basically equivalent to Tr{p^ } in our 



present approach, as discussed below (|60|) . With respect 
to thermalization, the results from [l3| are of a quite dif- 
ferent character than ours, mostly concerning "typical" 
|22l l2q . |37| properties of pure states which are randomly 
sampled according to a uniform probability density from 
certain sub-Hilbert spaces. In the opinion of the present 
author (see also Sects. HI and IVBJ) . the main open ques- 
tion of this approach is in how far one particular pure 
state or an ensemble of uniformly distributed pure states 
are suitable to describe a real experimental setup. 
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XIII. APPENDIX A 

In this Appendix we establish the relations (l96l) and 
(liJTl) . As detailed above (|M|) . analogous definitions and 
equations as in (!{?]) - (fTT)) . but with indices "S" and "B", 
hold true and are exploited in the following. 

With the help of ([12")) and (JHTJ) we can conclude that 

nrn n 

= f dE'ujB(E-E')J2 S ( E '- E n) = 

J n 

= J dE'u B {E-E')u s {E') , (122) 

where ^2 nm indicates a summation over all n, m = 
0,1,2,.... It follows from (12) that u s (E') = for 
E' < E§ and similarly that oj b (E - E') = is for 
E' > E — Eq . Taking for granted that the integrand 
u!b(E — E')ujs{E') is a sufficiently smooth function of 
E' , it thus will have an absolute maximum in the inte- 
rior of the interval [Eq , E — Eq]. For any given E- value 
exceeding the ground state energy E 00 = Eq + Eq of the 
compound system, the absolute maximum will further- 
more be generically unique. 

Next we rewrite (|122l) by means of (fTTj) and (J14I) as 



e S(E)/k B 

k B T(E) 



e [S B {E')+S s {E-E')]/k B 

dE' ; ; ; (123) 

k B T B {E')k B T s {E - E>) V ' 



Similarly to the integrand in (|122[) , the exponent in (|123[) 
generically exhibits a unique absolute maximum at some 
E'- value in the interior of [Eq , E — Eq], henceforth de- 
noted as E B (E). The next step is to evaluate (|123[) by 
means of a saddle point approximation, i.e. by expand- 
ing the exponent around its maximum up to the second 
order. At the maximum, the derivative of the exponent 
vanishes, yielding with (|13l) the relation 



T B (E B (E))=T S (E S (E)) 
where we have introduced 

E S {E) := E - E B {E) 



(124) 



(125) 



We emphasise that, like in Sects. Ill Al and IIX C 31 we 
avoid speaking about system states. If we give up this 
viewpoint for a second, then (|124[) is nothing else than 
the so-called equilibrium condition for two systems with 
negligible (but non-zero) interaction in thermal equilib- 
rium, sharing the total system energy E according to 
CH). 

Expanding the exponent on the right hand side of 
(|123p up to the second order about its maximum at 
E' = E B (E) in combination with (|17p yields 

S B (E') + S S (E - E') S B (E B (E)) + S S (E S (E)) 



- o[f L 



E' -E B {E) 



-I 2 



Eb(E)-EB 



+ fs 



E'-E B {E) 



E S {E)-E$ 
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Since f B is at least of the order of 10 23 (see below ((70)) ). it 
follows that only an extremely small neighborhood of the 
maximum notably contributes to the integral in (|123|) . 
and within this neighborhood the variations of the non- 
exponential factors on the right hand side of (|123[) are 
negligibly small according to (fl"6[). Performing the re- 
maining Gaussian integral in (|123l) yields 



S(E) = S B (E B (E)) + S S (E S (E)) 
k B k B 

R = ^k B T(E)/[k B T B (E B (E))] 2 (128) 

Ib fs 



ln(O(i?)0l27) 

(128) 
(129) 



[E B (E)-E*\* [E S {E)-E$f 

From (TT5"]) and (|7ip one can infer that R is of the order 
of /. Considering that the quantity in (|127[) scales like 
/ according to (fT5")) and that / is at least of the order 
of 10 23 , the contribution of hx(0(R)) in (fP27) is negli- 
gible. We thus obtain in extremely good approximation 
the relation 



S(E) = S B (E B (E)) + S S (E S (E)) 



(130) 



Differentiating this relation with respect to E and taking 
into account ([T3]). (fT24|) , and ([125} yields 



T(£) = T B {E B (E)) = T S (E S (E)) 



(131) 



The latter two relations are identical to (|96|) and (|97|) in 
the main text 



XIV. APPENDIX B 

In this Appendix we derive the relation 

5> ? 2 m (0) < max;p £ „ . (132) 

n 

To do so, we first exploit f[25|) to conclude 



pI 



„ = 2J 2_/ Pmm(0)p m 'm'(0) 

E m =E n E' m =E„ 

E m =E n 



(133) 



and hence 

n E„ E m =E n 

^ X]pL < y^PB„ m axpB m ■ (134) 

With ([H we obtain ([i"32]) . 



XV. APPENDIX C 

In this Appendix, the derivation of (|48|) is provided. 
To start with, we recall that the delta-functions in (|4f))) 



and (1471) are understood to be "washed out" over many 
energy levels. Hence, smoothening p{E) with the help of 
yet another washed out delta-function actually does not 
change p{E) any more: 



p{E) = f dE' 8{E - E') p{E) 



(135) 



Introducing (14"51 and (|4T|) on the right hand side yields 
p{E) = f dE' S(E - E')Y,h(En)S(E' - E n ) 

J n 

+ J2 5 Pn5{E - £„) ■ (136) 

n 

Since the delta-functions are washed out, the last sum- 
mand essentially amounts to a local average over many 
Sp n and is thus negligible (see below (|4"B")l ). In turn, the 
function h(E) hardly changes within the peak region of 
the delta-functions. Hence h(E n ) can be replaced by 
h(E') and then by h(E). Altogether we thus obtain 

p(E) = h(E) I dE' S(E - E') J2 S(E' - E n ) (.137) 

J n 

The sum can be identified with ui(E') from (fl"2|) . Since 
the latter is once again already a locally averaged quan- 
tity, the integral over the last remaining delta-function is 
trivial, yielding (|48)) . 



XVI. APPENDIX D 



According to ([54]) it follows that 



1 for a = 



e iat _ o for a ^ . (138) 

Next we exploit (J54I) and (l22l to conclude that 

p(<) = X)pm„(0) e -<[^-B»]*A |m)(n| . (139) 



In combination with (J27I) and (jl38[) we thus recover f|55[) . 
With (|32) and (|58l) we can rewrite the variance from 
(1591 as 



exj = [Tr{pft)A| - Tr{p e ^}] 2 

= JT^pp (140) 

p(t) := p(t) - Peq . (141) 

Exploiting ([22J). (|2"7|i . and (J33J) we obtain 

fli) = E'pmn e -*^-»- B -]*/'* | TO )( n | , (142) 

where J2mn indicates a summation over all m,n — 
0, 1,2... with E m ^ Z? n , and where we adopted the ab- 
breviation 



Pmn ■ — Pmn\y) 



(143) 
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It follows that Tr{/5(£)} = and hence 

Tr{p(t)(A + l H c)} = Tr{p(t)A} (144) 

for any c £ R, where 1-^ is the identity on T~L. 

As said below (|33p . we can and will replace 'H and ^4 
by 1-L+ and A + in the rest of this Appendix. Introducing 



A := A + - l n min(ip\A\ip} 



H-i 



(145) 



we can infer from (|55|) that 
< {i>\A\ip) < A^ for all normalized \ip) e H+ . (146) 



Taking into account (|144[) and (|145|l . the variance (|140|l 
can be rewritten as 



<7J = [Tr{p(t)i}]2 . (147) 

Introducing (|142|) into (|147j) we obtain 



<d = E ^PkjAnnPnm ^-E^E^E^t/H ? (M8) 

where J2jkmn indicates a summation over all j, k,m,n = 
0, 1, 2... with #j ^ E k and £ m 7^ £„. 

Since, according to (|138[) , the time averaged exponen- 
tials in (|148p vanish if £j — Ek + -Em - K ^ we can 
conclude from the non-resonance condition (|52p that 



«d = EV 



mn\ \Hmn 
ran ran 



l 2 \Prr,J 2 < 



^2\A mn \ 2 \p mn \ 2 , (149) 



where the first sum runs over all m, n with E m 7^ E n 
and the second over all m, n. With (|24|) and (|55|) we thus 
obtain 



<d < 



/ J -^mnPnn-^nmPmrn 
mn 

y^(m[ Ap e g I ra) (n I Ap eq 



'") 



(150) 



The sum over n amounts to an identity operator and that 
over m yields 



a 2 =Tr{[i Pe9 ] 2 }. 



(151) 



Next, we evaluate this trace with the help of the eigen- 
vectors \xn) of A, yielding 



<d ^ 22{Xm\PeqA\Xn){Xn\PegA\ 



Xr, 



(152) 



Observing that_A|xn) = Wn) {X n\A\ Xn) (since |x„) is 
eigenvector of A) we can exploit (|146[) to obtain 



o\ < ( A A) 2 ^2{Xm\peq\Xn){Xn\peq\Xr> 



(153) 



The sum over n yields the identity operator and that over 
m amounts to Tr{/9 2 q }, yielding 



al<(A^Tr{pl q }. 



(154) 



Finally, we note that according to (|56| and (|141[) we 
can subtract from A in (|140[) an arbitrary function B(h) 
of the form ([39| with the only consequence in the final 
result (|154p that A^ goes over into A' A _ B , h y Since this 
conclusion holds for arbitrary -B(b), the inequality even 
remains true after minimization over all B(h), i.e. we can 
replace A^ in (|154|) by A^ from (J4TJ) . In other words, we 
recover (|6T))) . 



XVII. APPENDIX E 

The purpose of this Appendix is a more detailed jus- 
tification of the non-degeneracy assumption adopted in 
Eq. (|75|) . More precisely, we will argue that the energy 
levels E n (X), considered as functions of A, do not cross 
each other for generic Hamiltonians H(X). 

Intuitively, one expects that the E n (X) will be highly 
non-trivial functions of A and that for different n these 
functions will behave notably "different" from each other. 
Since the energy levels are unimaginably dense (cf. Sect. 
Ill Ap . crossings of neighboring levels E n (\) upon varia- 
tion of A thus might seem to be almost unavoidable. 

But closer inspection shows that, on the contrary, such 
level crossings are actually avoided for generic Hamilto- 
nians H(X) due to the so-called level-repulsion mecha- 
nism: The levels E n (X), considered as functions of A, 
are governed by the following exact evolution equation, 
originally due to Pechukas and Yukawa |39j |: 



dX 2 



E n (X) = 2^ 



\V nm (X)\< 



rn^n 



E n (X) — E m (X) 



*nm(.X) 



(n(A)|— ff(A)|m(A)) 



(155) 



(156) 



where E n (X) and |n(A)) are the "accompanying" eigen- 
values and eigenvectors of H(X). The closing evolution 
equations for V nm (X) are also known, but not explicitly 
needed for our purpose. The main point is that looking 
upon A as "time" and E n (A) as "particle positions" , Eq. 
(|155p is nothing else than the Newtonian equation of mo- 
tion for a one-dimensional "gas" of point particles, the 
so-called Pechukas- Yukawa gas [30|. The particles are 
repelling each other with "coupling strengths" |Ki m (A)| 2 
which depend on "time" A. In the generic case (no spe- 
cial symmetries or "selection rules"), the coupling of two 
neighboring particles, say |H m +i(A)| 2 , will be positive 
with the exception of at most a discrete set of time- 
points A, and as a consequence, any "attempt" of the 
two neighboring levels E n (X) and E n+ i(X) to cross each 
other is inhibited by a repulsive "force" term in (|155[) of 
the form \V nn+ i(X)\ 2 [E n (X) — E n+ i(X)]~ l which diverges 
as E n (X) ->E n+1 (\). 
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